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SUMMARY 

The  handling  of  storage  assignment  and  subroutine  Inter¬ 
connections  for  the  704  linear  programming  codes  are  dis¬ 
cussed.  The  available  storage  of  the  machine  Is  divided  Into 
two  main  parts  --  code  and  data.  Their  layouts  are  fairly 
Independent  of  one  another  and  the  programs  Include  routines 
for  automatic  "housekeeping"  during  loading  and  running  of  a 
Job.  Flexibility  Is  provided  for  systematic  modifications  in 
the  programs  or  for  machines  with  various  storage  facilities. 
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I  have  twc  reasons  for  discussing  the  storage  allocation 
that  we  used  In  the  coding  of  the  Linear  Programming  problem. 
The  foremost  reason  Is  to  Increase  your  understanding  of  the 
operation  of  the  LP  code,  and  secondly,  I  think  It  Is  an 
excellent  solution  of  the  storage  allocation  problem  for  any 
big  general  program.  The  704  code  for  solving  the  Transporta¬ 
tion  problem,  which  I  worked  on  In  New  York, solves  the 
Transportation  problem  admirably,  but  It  lacks  the  tidiness 
that  a  good  storage  layout  provides. 

To  begin  with,  the  LP  code  was  designed  for  a  704  with 
4090  words  of  magnetic  core  storage,  one  drum  unit,  five 
tape  units,  a  card  reader,  a  printer-,  and  a  punch.  Only  four 
tape  units  are  necessary  If  no  off-line  printing  Is  desired. 
The  code  has  already  been  modified  to  teike  advantage  of  the 
additional  4090  words  of  core  storage  which  has  been  added  to 
some  of  the  704 's  now  using  the  LP  code.  A  more  extensive 
modification  Is  planned  for  the  32,000  word  core  storage  unit. 

The  assembly  of  Input  data  Is,  at  present,  completely 
separated  both  logically  and  physically  from  the  LP  main  code. 
The  data  assembly  program  Is  made  up  of  many  subroutines  to 
accommodate  the  different  options  which  are  allowed.  For 
example,  the  data  may  be  loaded  on  a  tape  via  the  peripheral 
card-to-tape  equipment  Instead  of  the  card  reader.  The  Data 
Assembly  code  contains  the  necessary  subroutines  to  accept 
the  data  either  way.  The  layout  Is  somewhat  similar  In 
structure  to  the  main  code  since  we  wanted  to  use  the  same 
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data  and  constants  regions.  Since  the  Data  Assembly  code  did 
not  fit  into  the  assigned  space  in  high-speed  storage,  we 
broke  it  in  half  storing  the  second  half  on  a  drum.  After  the 
first  half  is  executed,  the  second  half  is  read  in  from  drums 
overwriting  the  first  half.  Control  is  then  returned  to  the 
beginning  of  the  code  and  the  second  half  is  executed.  This 
type  of  sleight  of  hand  is  very  common  where  space  is  short 
and  programs  are  long.  It  works  best,  however,  when  the  pro¬ 
gram  need  be  executed  only  once  before  it  is  destroyed,  as  in 
this  case. 

For  the  main  code,  high-speed  storage  is  divided  up  as 
follows  during  the  execution  of  the  code; 


0--  15 

comoN 

16 

16-  391 

MASTER  CONTROL  ROUTINES  (NCR) 

376 

392--  487 

UNIVERSAL  SUBROUTINES 

(DPFADD, 

488--  703 

DPFMUL,  DISTRB) 

96 

SUBROUTINES 

216 

704--  767 

UNIVERSAL  CONSTANTS 

64 

768-- 

H-REQION 

M+1 

-- 

V-REOION 

2N^2 

-- 

W-REQION 

2Nf2 

2048--4096 

T-REOION 

2048 

COMMON  region  is  used  for  erasable 

storage 

by  any  MCR 

or  subroutine  at  any  time.  The  NCR  region  is  used  by  the 
three  different  Master  Control  Routines  which  we  have  at  the 
present  time.  Only  one  of  these  may  be  operating  at  a  given 
time.  In  normal  operation,  the  COMMON  NCR  is  used  until  an 
optimal  solution  is  reached,  or  until  the  operator  decides  to 
stop  and  invert  the  present  basis.  The  INVERSION  NCR  must 
then  be  read  into  the  NCR  region  from  binary  cards  to  perform 
the  inversion.  After  the  inversion  is  completed,  the  COMMON 
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MCR  would  be  read  In  again  from  binary  cards  and  the  computa¬ 
tion  could  proceed.  This  manual  switching  of  the  MCR  codes 
may  be  eliminated  by  saving  the  MCR's  Internally,  but  at  the 
time  these  MCR's  are  loaded.  It  may  also  be  desirable  to  load 
part  or  all  of  the  data  In  again  since  the  restarts  are 
handled  In  this  way.  In  the  near  future  some  changes  may  be 
made  In  regard  to  the  way  restarts  are  handled  so  the  MCR's 
will  be  saved  Internally  for  more  automatic  changing. 

There  are  three  universal  subroutines,  namely.  Double 
Precision  Floating  Point  Add,  Double  Precision  Floating  Point 
Multiply,  and  a  subroutine  called  the  Distributor.  The 
functions  of  the  first  two  are  adequately  explained  by  their 
titles.  They  are  always  available  for  use  by  any  of  the  sub¬ 
routines  (hence,  they  might  be  more  aptly  described  as  sub- 
subroutines)  .  The  Distributor  is  a  very  short  subroutine 
which  contains  a  table  of  the  number  of  Instructions  and  the 
drum  address  of  the  first  Instruction  of  each  subroutine  In 
the  code  (with  the  exception  of  the  Universal  Subroutines). 

At  present,  t.here  are  l4  subroutines  which  may  be  used  by  the 
maiiter  codes.  These  subroutines  do  a  great  deal  more  than  the 
ordinary  macro-instruction  type  subroutines.  The  pricing 
operation,  for  example.  Is  a  single  subroutine.  When  the  NCR 
wants  to  execute  this  subroutine.  It  is  only  necessary  to 
execute  the  two  Instructions 

CLA  PRICE 

TSX  DISTRB,  C 
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The  first  Instruction  places  the  code  number  corresponding 
to  the  desired  subroutine  In  the  accumulator  register  sc  the 
Distributor  can  select  the  proper  drum  address  from  the  table. 
The  second  Instruction  places  the  2's  complement  of  Its  ovm 
location  In  Index  register  C  and  transfers  control  to  the 
Distributor.  First,  the  Distributor  saves  the  contents  of 
Index  register  C  In  a  standard  location.  Then  the  drum 
address  of  the  first  Instruction  and  total  number  of  Instruc¬ 
tions  In  the  subroutine  are  taken  from  the  table.  The 
Distributor  reads  the  subroutine  from  the  drum  Into  the 
standard  subroutine  region  and  transfers  control  to  the  first 
Instruction  of  that  region  (488).  The  subroutine  Is  then 
executed.  Upon  completion  of  the  subroutine.  Index  register 
C  Is  reloaded  from  the  standard  location  with  the  2's  comple¬ 
ment  of  the  location  of  the  TSX  Instruction  In  the  MCR. 

A  TRA  1,C  Is  executed  which  returns  control  to  the  Instruction 
following  the  TSX  Instriictlon  In  the  MCR.  In  the  704,  the 
effective  address  of  a  tagged  TRA  Instruction  Is  computed  by 
subtracting  the  contents  of  the  Indicated  Index  register  from 
the  stated  address  In  the  Instruction.  The  Index  registers 
do  not  have  an  associated  sign,  hence,  the  subtraction  Is  done 
by  taking  the  2's  complement  of  the  number  In  the  Index 
register  and  adding  It  to  the  stated  address.  The  2's  comple¬ 
ment  of  the  2's  complement  Is  the  number  Itself.  In  the  ex¬ 
ample  above,  we  added  one  to  the  original  location  and  thus 
returned  control  to  the  Instruction  following  the  TSX 
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Instructlon  In  the  MCR.  If  you  are  not  confused  by  now,  the 
rest  will  be  easy . 

Since  It  Is  possible  to  move  10,000  Instructions  per 
second  between  drum  and  high-speed  storage,  very  little  time 
is  used  by  keeping  all  of  the  subroutines  on  drum  and  calling 
them  in  fresh  each  time  they  are  to  be  executed.  Thus,  all  of 
the  ''heavy"  work  was  relegated  to  subroutines  while  decision 
making  and  control  remained  to  the  NCR's.  Thus,  to  call  a 
subroutine  off  the  drum  and  execute  It  requires  only  two  In¬ 
structions  In  the  MCR,  the  Distributor  routine,  and  a  very 
short  amount  of  time.  Each  of  the  subroutines  must  necessar¬ 
ily  make  certain  assumptions  concerning  the  disposition  of  the 
data  which  It  uses.  It  Is  the  prime  function  of  the  PICR's  to 
control  the  various  activities  and  flow  of  data  using  the  sub¬ 
routines  like  the  powerful  tools  they  are. 

In  the  space  allotted,  you  will  note  that  a  subroutine 
cannot  exceed  21b  words  In  length.  Later  on,  I  will  show  how 
we  can  fudge  this  number  up  a  bit,  but  first  I  want  to  point 
out  the  storage  for  constants  and  data.  The  Universal 
Constants  contain  the  numbers  0,  1,  2,  and  other  commonly  used 
Integers.  They  also  Include  all  of  the  standard  locations  for 
parameters  to  Indicate  which  phase,  which  stage,  which  cap¬ 
tion  the  print  program  will  use,  etc.  Some  of  these  constarits 
are  fixed  by  the  Data  Assembly  program  to  define  the  storage 
layout  for  the  particular  problem  being  run  such  as  M^l,  2N^2, 
VORION,  WORIQN,  etc. 


The  H-REOION  always  contains  wox*ds  and  Is  used  to 
keep  the  names  of  the  current  activities  which  constitute  the 
basis.  The  V  and  W  regions  are  2N4‘2  words  In  length  and  each 
may  contain  a  double-precision  vector.  The  solution  vector. 
Beta,  Is  normally  kept  in  V-reglon  from  where  It  Is  used.  The 
W-reglon  Is  used  for  working  storage  In  generating 

while  working  with  the  transformation  vectors  In 
T-reglon.  It  Is  used  for  several  purposes,  all  of  which 
require  an  expanded  double  precision  vector.  The  transforma¬ 
tion  vectors,  Eta^,  are  kept  In  condensed  form  In  T-reglon  when 
they  are  being  used.  At  other  times,  they  are  stored  on  drum 
3,  or  transferred  to  tape  (end-of -stage  procedure)  if  they 
exceed  the  capacity  of  T-reglon.  The  only  difference  between 
the  code  for  the  4096  and  the  8192  word  core  storage  machines 
Is  that  T-reglon  is  expanded  to  half  of  2046  plus  4096,  that 
Is  from  2048  to  3072.  Drum  3  Is  not  used  at  present  In  the 
8192  word  machine.  In  place  of  using  the  drum  for  temporary 
storage  for  the  Eta  vectors,  the  high-speed  storage  following  1 
T-reglon  Is  used.  This  change  has  caused  a  considerable 
Increase  In  speed  over  the  4096  word  machine.  The  T-reglon  , 

Is  also  used  for  temporarily  holding  the  matrix,  or  as  much  < 

of  It  as  possible,  when  It  Is  used. 

The  tapes  are  used  for  permanent  storage  of  the  matrix 
(tape  number  3)f  for  permanent  storage  of  the  transformation 
vectors  (tapes  numbered  2,  3,  and  4),  and  tape  6  is  used  as 
output  by  the  print  program  If  peripheral  printing  Is  desired. 
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Three  tapes  are  needed  for  the  transformation  vectors  for 
reliability  and  because  they  are  used  In  both  a  forward  and 
a  baclcward  direction.  Tape  2  contains  the  vectors  In  the 
order  In  which  they  were  generated  providing,  of  course,  that 
they  exceed  the  capacity  of  T-reglon.  Tapes  3  and  4  are  used 
alternately  to  contain  the  transformation  vectors  In  backward 
direction  with  respect  to  records  only.  That  Is,  at  the  first 
end-of -stage,  the  record  Is  written  on  tapes  2  aind  3*  At  the 
second  end-of -stage,  the  new  record  Is  added  to  tape  2  and 
written  on  tape  4.  Then  the  contents  of  tape  3  are  written 
on  tape  4  following  the  new  record.  Tapes  3  and  4  are  alter¬ 
nated  In  this  way  as  long  as  necessary. 

The  drum  layout  la  straightforward  and  flexible  enough  so 
that  It  can  be  changed  around  If  desired.  There  are  four 
logical  drums  in  a  drum  unit  of  the  704  with  2048  words  each. 
The  first  logical  drum  Is  selected  by  the  drum  load  button 
sequence  which  we  use  extensively  for  restart  procedures. 

Thus,  the  program  labeled  DRBOOT,  for  drum  bootstrap,  has  to 
be  placed  at  the  beginning  of  drum  1. 

The  last  2Hf2  words  of  drum  1  have  been  arbitrarily  chosen 
to  keep  b^,  the  original  right-hand  side.  All  of  the  other 
data  Including  the  current  MCR  and  constants  are  saved  on 
drum  1  for  the  automatic  restart.  Drum  2  and  the  rest  of  drum 
1  are  used  for  sub-MCR's  and  all  cf  the  subroutines.  Drum  3 
Is  used  for  storing  transformation  vectors  when  T-reglon  is 
being  used  for  other  purposes  except  when  using  an  8192  word 


P-907 

machine.  Drum  4  Is  used  for  other  vectors  generated  during 
the  runs  which  must  be  temporarily  saved  during  the  parametric 
programming. 

Now  I  would  like  to  show  the  way  In  which  we  are  able 
to  cheat  on  the  rigid  storage  layout  of  both  the  MCR  region 
and  the  subroutine  region.  The  Inversion  MCR  exceeds  the  376  , 
word  limitation  on  the  size  of  the  NCR's.  Here  again,  we  use 
the  sleight  of  hand  technique  of  loading  a  part  of  the 
Inversion  NCR  as  a  subroutine  with  a  code  of  zero.  The  main 
part  of  the  Inversion  MCR  calls  In  this  sub-NCR  via  the  dis¬ 
tributor.  This  sub -MCR  knows  a  priori  where  the  MCR  gets 
stored  on  drum  2  so  that  It  calls  It  In  to  the  MCR  region 
overwriting  itself  after  It  has  finished  Its  function.  I 

The  program  which  loads  the  subroutines,  the  sub-MCR's, 
and  the  current  MCR  Is  known  as  the  executive  loader  (EXECLD). 
This  is  a  standard  binary  loading  program  which  has  been 
modified  for  our  special  needs.  The  standard  binary  loader 
reads  cards  and  stores  the  words  in  storage  getting  the 
number  of  words  on  each  card  and  the  loading  address  of  the 
first  Instruction  from  the  left  word  In  the  9's  row  on  the 
card.  A  hash  check  sum  of  the  card  Is  In  the  right  word  In 
the  9' 5  row.  Before  It  stores  the  Instructions  away.  It  teats 
the  word  count  to  see  If  It  Is  zero.  When  It  finds  a  card 
with  a  count  of  zero.  It  uses  this  Indication  to  stop  loading 
and  transfers  control  to  the  location  Indicated  In  the  address 
part  of  the  left  woixl  In  the  9*s  row. 
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The  executive  loader  uses  the  same  indication,  a  count  of 
zero,  and  applies  a  detailed  examination  of  the  right  word  In 
the  9' 3  row.  Por  clarification,  a  card  with  a  word  count  of 
zero  is  called  a  TITLE  card  instead  of  a  transfer  card  In  our 
write-ups.  The  number  In  the  9' a  right  row  of  the  title  card 
Indicates  whether  a  subroutine,  sub-MCR  or  MCR  has  Just  been 
loaded . 

While  the  executive  loader  Is  loading  these  sub-MCR 'a  and 
subroutines.  It  stores  them  away  first  on  drum  2  and  then  on 
drum  1  when  drum  2  Is  full.  It  keeps  track  of  the  first  and 
last  loading  address  of  each  code  being  loaded  between  suc¬ 
cessive  title  cards.  Thus  It  computes  the  exact  number  of 
Instructions  In  the  routine  which  It  adds  to  a  counter  to 
keep  track  of  the  space  left  on  the  drum  which  It  Is  loading. 
Prior  to  loading  each  routine  It  places  a  return  Jump 
(TRA  1,  C)  In  the  first  location  of  the  MCR  region.  This  Is 
automatically  destroyed  by  loading  MCR’s  and  also  by  some  sub¬ 
routines  since  we  use  another  device  for  increasing  the  total 
length  of  subroutines.  This  is  to  put  a  preface  to  some  sub- 
I’outlnes  In  the  MCR  region  and  execute  this  set  of  code 
before  the  executive  loader  tucks  the  subroutine  away  on  a 
drum.  The  preface  Is  used  to  preset  those  addresses  In  each 
subroutine  which  depend  on  the  particular  problem  the  code  Is 
working  on.  You  may  recall  that  M  Is  variable  for  each  prob¬ 
lem  and  determines  the  limits  of  H,  V,  and  W  regions.  Thus 
the  executive  loader  always  transfers  control  to  the  first 


instruction  of  the  MCR  where  the  preface  Is  executed.  If 
there  Is  one,  and  control  returns  to  the  executive  loader 
after  the  subroutine  Is  initialized.  Then  the  executive 
loader  ignores  the  preface  In  the  MCR  region  and  tucks  the 
condensed  subroutine  away  on  the  drum,  places  the  number  of 
instructions  and  first  drum  location  \n  the  table  of  the  Dis¬ 
tributor  routine  and  proceeds  to  load  the  next  subroutine  or 
MCR.  Thus,  the  order  of  loading  Is  determined  to  be: 

BASE  routines  (DRBOOT,  Universal  sub-routines, 

EXECLD,  the  CONSTANTS) 

SUBROUTINES 

SUB-MCR's 

MCR 

DATA 

The  Inherent  beauty  In  this  loading  scheme  Is  that  the  program¬ 
mer  does  not  have  to  know  how  many  instructions  are  In  each 
subroutine,  sub-MCR,  or  MCR  so  long  as  the  region  boundaries 
are  not  violated.  Again,  we  have  allowed  the  704  to  do  the 
housework  of  keeping  track  of  the  number  of  Instructions  In 
each  subroutine,  store  them  away  on  drums  efficiently,  and 
bring  them  back  again  when  they  are  called  for  by  the  MCR. 

The  print  subroutine  exceeds  the  subroutine  storage  so 
badly  that  we  have  a  pseudo-subroutine  In  Its  place  and  the 
real  print  program  Is  loaded  like  a  sub-MCR.  The  pseudo- 
subroutine  gets  called  In  and  It  In  turn  calls  the  real  print 
program  Into  T  region  since  that  Is  the  only  unoccupied  space 
available  which  Is  large  enough  to  accommodate  the  print 


program. 
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In  conclusion,  let  me  point  out  that  data  storage  Is  a 
function  of  the  number  of  equations  In  the  system  the  code  is 
working  on.  The  storage  of  the  LP  code  Is  Independent  of  the 
problem  and  parts  can  be  reassembled  and  Included  as  they  are 
needed.  If  we  had  a  70^  with  32,000  words  of  core  storage  and 
no  drum,  we  would  use  a  block  of  core  storage  for  the  main 
subroutine  storage.  Any  change  like  this  affects  only  two 
routines,  the  executive  loader  and  the  distributor.  If  we 
get  drums  and  32,000  words  of  core  storage,  we  would  still 
keep  the  subroutines  on  drum  and  use  all  additional  high-speed 
storage  for  transformation  vectors. 


